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Abstract: During the last decade, all commercial database systems have 
included features for parallel processing into their products. This 
development has been driven by the fact that databases grow in size at 
considerable rates. According to the results of the 1998 v very large 
database contest 1 the world's largest databases, which have reached a size 
of over 10TB, double in size every year. At that speed, they outgrow the 
increase in processor speed and memory size, so additional measures are 
required to accommodate the effects of rapidly growing volumes of data. 
Parallelism is one of those options. It helps to keep processing times 
constant, even if the size of the database increases . That effect, 
which is often referred to as s scaleup' is important for loading, index 
creation, all kinds of administrative operations on the database, and of 
course for long batch-type applications. Parallelism is also employed to 
speed-up queries that otherwise would take days or weeks to process and 
thus would be useless for the application. This type of requirement: fast 
results of complex queries on large data sets is characteristic of decision 
support applications . In this overview we will explain how parallelism in 
databases can help to solve such problems. (Author abstract) 19 Refs. 
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Abstract: Modern document management systems administer an increasing 
amount of data. Unfortunately, traditional database technology cannot 
satisfy the resulting performance requirements. In this paper we want to 
report on how parallel database technology can contribute to make large and 
scalable document management systems work. Our discussion clearly shows 
that simply substituting a sequential database system by a parallel one 
does not necessarily lead to scalability and high performance. In contrast, 
detailed knowledge on the application and document management system is 
necessary to decide on the optimal architecture of the parallel database 
system, its query optimization and parallelization strategies, as well as 
on its data fragmentation scheme and load balancing properties. All 
discussions are exemplified referring to an existing document management 
system developed at our university that is currently migrated from a 
sequential database system to a parallel one. (Author abstract) 19 Refs. 
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Abstract: We have developed a genomic database system on a network of 
workstations. The efficient access to genomic databases is crucial, given 
enormous increase in sequence data. By using a logic programming 
language, the system allows a user to perform adaptable data retrieval to 
integrated data objects in a single declarative framework. In addition by 
utilizing data-parallel processing, it provides efficient access in a large 
amount of genomic data on distributed computing environment. We present the 
implementation of the database system and its performance by measuring 
processing times for querying molecular sequence data. (Author abstract) 12 
Refs . 
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Abstract: We develop a reliable disk array based storage architecture for 
digital video retrieval . Our goals are two-fold: maximizing the number of 

concurrent real-time sessions while minimizing the buffering 
requirements, and ensuring a high degree of reliability. The first goal is 
achieved by adopting a pipelined approach and by reducing latencies through 
specialized disk caching and constrained data placement schemes. The second 
goal is achieved by dividing the disks into RAID 3 reliability groups which 
serve as pipeline stages. We note that the buffering requirement decreases 
as the number of groups increases . To improve the performance further, 
we introduce two techniques for more efficient movie retrieval: on arrival 
caching, and interleaved annular layout. We present a case study of the 
performance of these techniques which shows a significant improvement when 
they are incorporated. (Author abstract) 9 Refs. 
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Abstract: With the advent of inexpensive microprocessors and high 
bandwidth interconnects, coupling a large number of processors to form a 
highly parallel system has become an increasingly popular method for 
improving the cost-performance ratio of computer systems. Recent work has 
shown that this method is also applicable to database systems with 
increasing benefits as the queries become larger and more complex. The 
objective of this paper is to examine the various issues encountered in 
parallel query processing as well as the techniques that are available 
for addressing these issues. (0 Refs) 
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Abstract: A word-based text document retrieval system is being 
implemented by AMT (Active Memory Technology) on the AMT DAP series of 
massively parallel computers. For a 20 G Byte database of 10 million 
documents a system with 2 DAP 610s can sustain around 70 Boolean queries 
per second. Any of these parameters can be extended. In the initial 
application the addition of two DAP 610s to a cluster of four large VAXes 
will increase the performance by several hundred times. (10 Refs) 
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When parallel lines meet — Only parallel databases wring the best 
performance out of parallel-processing systems 
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Discusses how parallel databases bring out the best performance from 
parallel processing systems. Illustrates how partitioning and pipelining 
increase database performance in a sample sequential query. Says that 
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types of multiprocessor architectures include shared-everything, 
shared-disk and shared-nothing architectures. Says that there are also two 
kinds of parallel database architectures, shared-nothing and shared-disk, 
which were intended to address issues in shared-nothing hardware designs. 
Concludes tha the use of parallel queries offer the most effective 
approach to improve query -processing performance. Includes a sideba 
^Partitioning Data 11 about the basic methods of partitioning data Includes 
three diagrams, two graphs and one flow chart. 
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Languages : English 

Document Type: Product Announcement 
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Reports that Oracle Corp. of Redwood Shores, CA, will release Oracle 7.1 
($NA) . Says it will offer user-extensible SQL with application-specific 
data types and function, a primary site replication, and a GUI-based server 
management product. Says a Parallel Query Option will also be available 
for Sun Microsystems Inc. and Sequent Computer Systems Inc. symmetric 
multiprocessing servers, and will ship on 12 additional parallel processing 
s soon. Adds that Oracle's synchronous replication feature, whi allows 
multiple servers to gather and replicate data, has entered beta testing, 
(cr) 
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ABSTRACT : 

The authors propose two concurrency control algorithms for distributed 
database systems. Both algorithms employ a token passing mechanism, in 
which the path of the token is determined dynamically in order to maximize 
the system's throughput. The two algorithms differ in terms of selecting 
the next holder of the token in case the system becomes empty. One scheme, 
the Dynamic-Ring Algorithm (DRA) switches to a virtual ring algorithm. The 
other, the Dynamic-Status Algorithm (DSA) employs a query based mechanism. 
Simulation study reveals that both the proposed algorithms perform 
considerably better than the conventional ring algorithm, under low to 
medium loading conditions in the network. The difference in performance 
becomes more prominent as the number of database sites increases . 
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ABSTRACT : 

To fit the needs of nowadays complex database applications it is 
necessary to increase the performance of relational database systems 
dramatically. This challenge can only be achieved by use of massive 
parallel systems. In this paper we want to show an methodical way of 
parallelizing an existing database system for a general-purpose 
multiprocessor. Four approaches of different granularity, i.e. kernel 
replication, module splitting, query processing, and operator parallelism, 
are discussed and their application to the database system TransBase is 
shown . 



